DQ Workflows
Benefits
The DQ Workflows package listed on Collibra Marketplace allows you to 1) create and manage Data Quality Issues, 2) receive Notifications on Rule Metrics, and 3) request Rule Creation and Modification within Collibra Data Intelligence Cloud. Data stewards will be able to organize and prioritize all requests within DIC before they take any action within Collibra Data Quality.
Once deployed, the workflows will facilitate quicker data issue remediation by involving business analysts and other personas who can now participate in your data quality workstreams.
Please note: DQ Workflows are listed on Collibra Marketplace and are templates to get customers started. Collibra-provided Marketplace listings are not subject to the same SLA obligations (https://marketplace.collibra.com/marketplace-terms/) In addition, they can only be leveraged within Collibra Data Intelligence Cloud. In the future, we will work towards releasing bi-directional workflows.
Step 0: Prerequisites
Resource | Notes |
---|---|
Collibra Edge Site | DQ Connector is a capability of Edge |
Collibra Data Intelligence Cloud | 2021.07 Release (or newer) |
Collibra Data Quality | 2.15 (or newer) |
Collibra DQ Connector | Synchronized Rules from Data Quality to Catalog |
Note After gathering all the prerequisites, you can now proceed to the next step.
Step 1: Download, Deploy and Start DQ Workflows
1A. Download Package from Collibra Marketplace and Unzip Files
**1B. Deploy Workflows **
1C. Adjust Workflow Settings (One-Time Setup)
Workflow Configuration Setting | DQ Rule / DQ Sync Request | DQ Rule Modification | DQ Data Remediation | DQ Issue Resolution | Manage DQ Subscriptions | Notify of DQ Metrics |
---|---|---|---|---|---|---|
Applies To | Asset | Asset | Asset | Asset | Global | Global |
Applies To Asset Type | Column, Table | Column, Table, Data Quality Rule | Column, Table | Issue | ||
Other: Any Signed In User Can Start Workflow | Y | Y | Y | Y | ||
Other: Perform Candidate User Check on Workflow Start | Y | Y | Y | Y | Y | |
Other: This Workflow Can Only Run Once At Same Time on Specific Resource | Y | Y | ||||
Other: Show In Global Create | Y | Y | ||||
Roles: Start Workflow | Sysadmin | Sysadmin | ||||
Roles: Stop Workflow | Sysadmin | Sysadmin | ||||
Roles: Reassign Tasks | Sysadmin | Sysadmin |
Step 2: Create Data Quality Requests / Issues
2A. Create Data Quality Issues
Workflow | Main Requestor Persona | Description | Steward Taking Action |
---|---|---|---|
DQ Data Remediation |
Data Steward Business Analyst |
Tracking / management for confirmed data issues which may require underlying data remediation | Data Lake Admin, ETL Engineer |
DQ Rule Request | Business Analyst | Proposing data quality rules in plain language e.g. "flag any German phone numbers in this dataset" or "identify customers with churn risk based on engagement time with our platform" | Data Steward |
DQ Rule Modification | Business Analyst | Proposing adjustments to existing rules e.g. values, dimensions, passing thresholds | Data Steward |
DQ Sync Request | Business Analyst | Request for synchronization of the DQ Connector to synchronize and/or onboarding a new dataset with pre-populated rules | Data Steward |
Step 3: Manage Data Quality Issues
3A. Setup Data Helpdesk Filter
Data Helpdesk
- Select Issues
- Navigate to 'Filters'
- Properties > Attributes > Relations > Issue **categorized by **Issue Category > Input 'Data Quality Issue' > Apply
- Save button > Save View as > 'Data Quality Issues'
- Optional settings for View: Can pin, promote, make public, make default
3B. Manage Issues From Data Helpdesk View
3C. Alternate: Manage Issues From Tasks
Step 4: Receive Notifications Of DQ Issues And Metrics
4A. Set Up DQ Metric Subscription
Who? Anyone can set up a DQ subscription, for yourself or for your teammates.
Alerts will be sent based on reviewing rules and metrics associated with Tables or Columns that violate the specified Threshold.
Assuming an e-mail is associated with the Subscriber within Collibra, the Subscriber will receive e-mail notifications by default at 12pm local server time. This, along with other settings within the provided workflow, can be adjusted in Eclipse, Collibra's recommended workflow editor.
4B. Review DQ Metric Alerts
Ensure that the DQ alerts set for you are providing helpful details.
4C. Update Subscription Settings
For every subscription set up for a Subscriber, the Manage DQ Subscriptions modal will cycle through for your review. You can update Threshold, add or delete Notification Days, add or delete Tables or Columns, rename the Subscription title, Save the new settings, or simply Unsubscribe.